Goto

Collaborating Authors

 Gary


Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers

arXiv.org Artificial Intelligence

Robustness of reasoning remains a significant challenge for large language models, and addressing it is essential for the practical applicability of AI-driven reasoning systems. We introduce Semantic Self-Verification (SSV), a novel approach that addresses the key challenge in combining language models with the rigor of logical solvers: to accurately formulate the reasoning problem from natural language to the formal language of the solver. SSV uses a consistency-based approach to produce strong abstract formalizations of problems using concrete instantiations that are generated by the model and verified by the solver. In addition to significantly advancing the overall reasoning accuracy over the state-of-the-art, a key novelty that this approach presents is a feature of verification that has near-perfect precision over a significant coverage of cases, as we demonstrate on open reasoning benchmarks. We propose such *near-certain reasoning* as a new approach to reduce the need for manual verification in many cases, taking us closer to more dependable and autonomous AI reasoning systems.


Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation

arXiv.org Artificial Intelligence

Despite their impressive capabilities, large language models (LLMs) often face challenges such as temporal misalignment and generating hallucinatory content. Enhancing LLMs with retrieval mechanisms to fetch relevant information from external sources offers a promising solution. Inspired by the proverb "Think twice before you act," we propose a dual-angle evaluated retrieval-augmented generation framework \textit{Think-then-Act}. Unlike previous approaches that indiscriminately rewrite queries or perform retrieval regardless of necessity, or generate temporary responses before deciding on additional retrieval, which increases model generation costs, our framework employs a two-phase process: (i) assessing the input query for clarity and completeness to determine if rewriting is necessary; and (ii) evaluating the model's capability to answer the query and deciding if additional retrieval is needed. Experimental results on five datasets show that the \textit{Think-then-Act} framework significantly improves performance. Our framework demonstrates notable improvements in accuracy and efficiency compared to existing baselines and performs well in both English and non-English contexts. Ablation studies validate the optimal model confidence threshold, highlighting the resource optimization benefits of our approach.


The Mail

The New Yorker

Andrew Marantz's appraisal of two Silicon Valley camps that hold conflicting ideas about A.I.'s development--"doomers," who think it may spell disaster, and "effective accelerationists," who believe it will bring unprecedented abundance--offers a fascinating look at the factions that have dominated the recent discourse ("O.K., Doomer," March 18th). But readers should know that these two vocal cliques do not speak for the entire industry. Many in the A.I. and machine-learning worlds are working to advance technological progress safely, and do not suggest (or, for that matter, believe) that A.I. is going to lead society to either utopia or apocalypse. These people include A.I. ethicists, who seek to mitigate harm that A.I. has caused or is poised to inflict. Ethicists focus on concrete technical problems, such as trying to create metrics to better define and evaluate fairness in a broad range of machine-learning tasks.


Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

arXiv.org Artificial Intelligence

As large language models (LLMs) perform more difficult tasks, it becomes harder to verify the correctness and safety of their behavior. One approach to help with this issue is to prompt LLMs to externalize their reasoning, e.g., by having them generate step-by-step reasoning as they answer a question (Chain-of-Thought; CoT). The reasoning may enable us to check the process that models use to perform tasks. However, this approach relies on the stated reasoning faithfully reflecting the model's actual reasoning, which is not always the case. To improve over the faithfulness of CoT reasoning, we have models generate reasoning by decomposing questions into subquestions. Decomposition-based methods achieve strong performance on question-answering tasks, sometimes approaching that of CoT while improving the faithfulness of the model's stated reasoning on several recently-proposed metrics. By forcing the model to answer simpler subquestions in separate contexts, we greatly increase the faithfulness of model-generated reasoning over CoT, while still achieving some of the performance gains of CoT. Our results show it is possible to improve the faithfulness of model-generated reasoning; continued improvements may lead to reasoning that enables us to verify the correctness and safety of LLM behavior.


Deep Active Alignment of Knowledge Graph Entities and Schemata

arXiv.org Artificial Intelligence

Knowledge graphs (KGs) store rich facts about the real world. In this paper, we study KG alignment, which aims to find alignment between not only entities but also relations and classes in different KGs. Alignment at the entity level can cross-fertilize alignment at the schema level. We propose a new KG alignment approach, called DAAKG, based on deep learning and active learning. With deep learning, it learns the embeddings of entities, relations and classes, and jointly aligns them in a semi-supervised manner. With active learning, it estimates how likely an entity, relation or class pair can be inferred, and selects the best batch for human labeling. We design two approximation algorithms for efficient solution to batch selection. Our experiments on benchmark datasets show the superior accuracy and generalization of DAAKG and validate the effectiveness of all its modules.


Opinion

#artificialintelligence

For a few years, I've been trying to write a story about a cat. A.I. will not be able to write this, partly because the story is still inside my imagination and on a few rough pages that were originally drafted in Boston, on sheets of notebook paper, as I sat in my daughter's apartment on a hot summer day. If I have it published (who knows, it's a strange story), perhaps some machine will suck it into a system, break down my style, my usage, the themes I like to touch upon -- loss and despair, love and hope -- wide-ranging themes that, like all themes, arrive out of my own unique human concerns and have fueled me through six story collections. But for now, this story I haven't yet finished is inside my imagination, safe and sound, and no machine can make it or conjure it because no machine has been in my head as I wandered the streets of South Chicago, or stared at Lake Michigan from Promontory Point on the particular day I was there in June, or stopped in the parking lot of a supermarket called Treasure Island to examine a pile of snow, left over from a long winter, honeycombed and covered with dirt and grime, which is the image that closes the rough draft of my story; no machine stood with me in front of the Obama house, on the corner of 1118 Hyde Park Boulevard, and watched a Secret Service agent as he approached, another image that sparked the plot of my story, and certainly no machine was with me watching a cat named Baudelaire, my daughter's cat, as he played on a particular Chicago afternoon, in a particular moment years ago, clutching a piece of string -- yet another image that spoke to me through the retrospect of memory. No machine -- and I use that phrase because A.I. is a machine, and no matter how complicated, or even organic, its still-binary, open-and-shut gates may be -- looked through my eyes as I took the train to my hometown in Michigan, gazing out over the old steel mills of Gary, Ind., making note of images with intent, storing and twisting them in relation to the pain I felt that moment, riding back to my hometown in Michigan, to my father's interment ceremony, an experience that reminded me that I, too, will die someday, and the art I create will be all I leave behind.


Follow the Timeline! Generating Abstractive and Extractive Timeline Summary in Chronological Order

arXiv.org Artificial Intelligence

Nowadays, time-stamped web documents related to a general news query floods spread throughout the Internet, and timeline summarization targets concisely summarizing the evolution trajectory of events along the timeline. Unlike traditional document summarization, timeline summarization needs to model the time series information of the input events and summarize important events in chronological order. To tackle this challenge, in this paper, we propose a Unified Timeline Summarizer (UTS) that can generate abstractive and extractive timeline summaries in time order. Concretely, in the encoder part, we propose a graph-based event encoder that relates multiple events according to their content dependency and learns a global representation of each event. In the decoder part, to ensure the chronological order of the abstractive summary, we propose to extract the feature of event-level attention in its generation process with sequential information remained and use it to simulate the evolutionary attention of the ground truth summary. The event-level attention can also be used to assist in extracting summary, where the extracted summary also comes in time sequence. We augment the previous Chinese large-scale timeline summarization dataset and collect a new English timeline dataset. Extensive experiments conducted on these datasets and on the out-of-domain Timeline 17 dataset show that UTS achieves state-of-the-art performance in terms of both automatic and human evaluations.


Meaning without reference in large language models

arXiv.org Artificial Intelligence

The widespread success of large language models (LLMs) has been met with skepticism that they possess anything like human concepts or meanings. Contrary to claims that LLMs possess no meaning whatsoever, we argue that they likely capture important aspects of meaning, and moreover work in a way that approximates a compelling account of human cognition in which meaning arises from conceptual role. Because conceptual role is defined by the relationships between internal representational states, meaning cannot be determined from a model's architecture, training data, or objective function, but only by examination of how its internal states relate to each other. This approach may clarify why and how LLMs are so successful and suggest how they can be made more human-like.


DeepI2P: Image-to-Point Cloud Registration via Deep Classification

arXiv.org Artificial Intelligence

This paper presents DeepI2P: a novel approach for cross-modality registration between an image and a point cloud. Given an image (e.g. from a rgb-camera) and a general point cloud (e.g. from a 3D Lidar scanner) captured at different locations in the same scene, our method estimates the relative rigid transformation between the coordinate frames of the camera and Lidar. Learning common feature descriptors to establish correspondences for the registration is inherently challenging due to the lack of appearance and geometric correlations across the two modalities. We circumvent the difficulty by converting the registration problem into a classification and inverse camera projection optimization problem. A classification neural network is designed to label whether the projection of each point in the point cloud is within or beyond the camera frustum. These labeled points are subsequently passed into a novel inverse camera projection solver to estimate the relative pose. Extensive experimental results on Oxford Robotcar and KITTI datasets demonstrate the feasibility of our approach. Our source code is available at https://github.com/lijx10/DeepI2P


Serial killer algorithm lets anyone analyse crimes

Daily Mail - Science & tech

A non-profit group in the US is keeping tabs on unsolved murders across the country – and their software can spot unnoticed trends to reveal a possible serial killer. Known as the Murder Accountability Project, the open-source website uses data from federal, state, and local governments. They've so far compiled details on more than 600,000 homicides, and as it's available for free online, anyone can access the data to look for connections in unsolved cases. A non-profit group in the US is keeping tabs on unsolved murders across the country – and their software can spot unnoticed trends to reveal a possible serial killer. Known as the Murder Accountability Project, it uses data from federal, state, and local governments.